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An  Upper  Asymptote  for  the  Three-Parameter  Logistic  Item-Response  Model 

Mark  A.  Barton 
and 

Frederic  M.  Lord 

Abstract 

An  upper-asymptote  parameter  was  added  to  the  three-parameter 
logistic  item  response  model.  This  four-parameter  model  was  compared 
to  the  three-parameter  model  on  four  data  sets.  The  fourth  parameter 
increased  the  likelihood  in  only  two  of  the  four  sets.  Ability 
estimates  for  the  students  were  generally  unchanged  by  the  introduction 
of  the  fourth  parameter. 


An  Upper  Asymptote  for  the  Three-Parameter  Logistic  Item-Response  Model* 

Mark  A.  Barton 
and 

Frederic  M.  Lord 

A  two-parameter  logistic  item-response  model  may  be  expressed  as 

-1.7a  (6-b  )  , 

F(0)  =  (1  +  e  8  8  T1 

where  F(0)  is  the  probability  of  a  student  with  ability  0  passing 
item  g  .  The  probability  of  passing  ranges  from  zero  to  one  as  0 
goes  from  to  *>  . 

On  a  multiple  choice  test,  however,  the  probability  of  choosing  the 
correct  answer  does  not  approach  zero  for  low-ability  students.  For  this 
reason,  a  lower  asymptote  is  introduced.  The  resulting  three-parameter 
model  is 

G (0 )  =  c  +  (1  -  c)F(e) 

where  the  lower  asymptote,  sometimes  referred  to  as  a  guessing  parameter, 
may  be  set  arbitrarily;  or  a  common  value  may  be  estimated  from  the  data; 
or  individual  values  may  be  estimated  from  the  data  separately  for  each 
item. 

An  upper  asymptote  may  likewise  be  introduced  into  the  model: 

P(9)  *  c  +  (6  -  c) F(0 ) 

While  F (0 )  ranges  from  zero  to  one,  P (0 )  ranges  from  the  lower 
asymptote,  c  ,  to  the  upper  asymptote,  6  . 

*This  work  was  supported  in  part  by  contract  N00014-80-C-0402 , 
project  designation  NR150-453  between  the  Office  of  Naval  Research  and 
Educational  Testing  Service.  Reproduction  in  whole  or  in  part  is 
permitted  for  any  purpose  of  the  United  States  Government. 
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Even  a  high-ability  student  may  make  a  clerical  error  in  answering 
an  easy  item.  The  introduction  of  an  upper  asymptote  with  a  value  of 
slightly  less  than  1  should  allow  a  high-ability  student  to  miss  an 
easy  item  without  having  his  ability  estimate  drastically  lowered. 

This  study  of  the  upper-asymptote  model  was  motivated  largely  bv 
the  concern  that  the  three-parameter  model  might  be  severely  penalizing 
high-ability  students  who  make  a  clerical  error  on  an  easy  item.  The 
three-parameter  model  was  compared  to  the  upper-asymptote  (four- 
parameter)  model  for  four  data  sets:  SAT  Verbal,  SAT  Math,  GRE  Verbal, 
and  AP  Calculus  AB. 

The  tests,  all  of  which  were  developed  by  Educational  Testing 
Service,  are  described  briefly: 

Scholastic  Aptitude  Test  (SAT),  Verbal:  The  SAT  is  a  75-minute, 
90-item  verbal  test  designed  for  applicants  to  undergraduate  study. 

All  items  are  five  choice. 

Scholastic  Aptitude  Test  (SAT),  Math:  The  mathematical  part  of 
the  SAT  (from  a  different  form  of  the  test,  with  different  students) 
was  analyzed  as  a  separate  data  set.  This  90-minute  test  has  85  items, 
20  of  which  are  four  choice,  and  65  of  which  are  five  choice. 

Graduate  Record  Examination  (GRE)  Aptitude  Test,  Verbal:  This 
test  of  verbal  reasoning  and  reading  comprehension  is  designed  for 
applicants  to  graduate  study.  All  of  the  items  on  this  50-minute, 
80-item  test  are  five-choice  items. 


College  Board  Advanced  Placement  (AP)  Examination — Mathematics: 
Calculus  AB:  This  is  a  test  taken  by  high  school  students  in  order 
to  receive  college  credit  for  their  knowledge  of  elementary  functions 
and  first-semester  calculus.  The  essay  items  were  not  included  in  this 
analysis.  The  objective  portion  lasts  90  minutes  and  has  4S  item-’. 

All  of  the  items  are  five  choice. 

For  each  of  the  four  data  sets,  the  following  procedure  was  used: 

1)  Ability  and  item  parameters  were  estimated  under  the 
three-parameter  model  using  the  program  LOGIST  (Wood  &  Lord, 
1976;  Wood.  Wingersky,  &  Lord,  1976),  with  several  thousand 
students.  (These  runs  were  already  available.) 

2)  In  order  to  reduce  costs,  a  random  subsample  of 
1000  students  was  chosen. 

3)  With  the  c^  parameters  (guessing  parameters)  held 
fixed  at  the  previously  estimated  values  (to  reduce 
costs),  the  remaining  item  parameters  (  a  ’s  and  b  Ts) 
and  the  abilities  (  8  fs)  were  reestimated  for  just  the 

1000  students  with  6  =  1.00. 

♦ 

4)  With  the  c  parameters  still  held  fixed,  the  a  's,  b  ’s, 

g 

and  9  *s  were  estimated  with  upper  asymptote  values  of 
5  =  .99  and  6  =  . 98  ,  using  a  modified  version  of 


r 

i 


LOGIST  on  the  sample  of  1000  students. 


The  effect  of  the  upper  asymptote  was  measured  in  two  ways. 

First,  likelihoods  were  computed  for  each  value  of  the  upper 
asymptote  (  6  =  1.00  ,  6  »  .99  »  6  =  .98),  and  the  asymptote  value 

with  the  highest  likelihood  was  considered  the  best  fit.  In  two  of 
the  cases  (GRE  Verbal  and  AP  Calculus  AB)  the  highest  likelihood  was 
obtained  with  6  =  1.00  (the  standard  three-parameter  model)  and  in 
the  other  two  cases  (SAT  Math  and  SAT  Verbal)  the  highest  likelihood 
was  obtained  with  6  =  .99  .  In  none  of  the  four  data  sets  did  the 
highest  and  the  lowest  of  the  three  log-likelihoods  differ  by  more  than 
0.02%  (more  than  .0002). 

Second,  ability  estimates  under  the  two  asymptote  values  6  =  1.00 
and  6  =  .99  were  compared  to  see  whether  the  change  in  upper- 
asymptote  value  had  a  large  effect  on  the  ability  estimate  of  any 
individual  student.  Out  of  the  4000  students  examined  in  the  four 
data  sets,  the  greatest  change  was  an  increase  of  .64  under  the  .99 
upper-asymptote  model.  The  five  students  most  affected  are  summarized 
in  Table  1.  Two  of  these  were  very  high-ability  students,  and  the  other 
three  were  very  low-ability  students.  The  ability  estimate  changed 
by  more  than  the  standard  error  of  estimate  in  only  one  of  the  five 
cases.  In  all  five  cases,  the  ,99-upper-asymptote  model  favored  the 
student.  The  largest  drop  in  a  student’s  ability  estimate  under  the 
.99-upper-asymptote  model  was  .36,  for  a  student  of  very  low  ability 
on  the  GRE  Verbal.  All  1000  students  who  took  the  GRE  are  plotted 
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lable  1 

The  Five  Students  with  the  Largest  Absolute  Changes 
In  r,'  ut  of  4  000  Examined 


Standard  Error 
of  Measurement 


95=1.00 

05=.99 

e5«.99~H*«i.OO 

of  Vl.00 

Test 

2.9 

3.4 

0.  5 

0.4 

GRE  Verbal 

2.8 

3.3 

0.  5 

0.7 

AP  Calculus  AB 

-3.9 

-3.3 

0.6 

1.1 

SAT  Verbal 

-5.0 

-4.4 

0.6 

2.7 

GRE  Verbal 

-5.9 

-5.4 

0.  5 

4.8 

GRE  Verbal 

Thetas  have  a  mean  of  0  and  a  standard  deviation  of  1. 


in  Figure  1;  in  the  plots  for  the  other  three  data  sets,  points  were 
even  closer  to  the  identity  line* 

One  might  expect  more  clerical  errors  to  be  made  on  a  highly 
speeded  test.  This  might  result  in  the  .99-upper-asymptote  model  fitting 
better  than  the  1.00-upper-asymptote  model.  The  only  highly  speeded 
test  of  the  four,  the  GRE  Verbal,  was  fit  best  with  the  1.00-upper- 
asymptote  model,  however,  discounting  this  speededness  hypothesis. 

In  view  of  the  failure  of  the  four-parameter  model  either  to 
consistently  improve  the  likelihood  or  to  significantly  change  any 
ability  estimates  there  is  no  compelling  reason  to  urge  the  use  of  this 
model.  The  extra  computational  time  required  for  the  more  complex 
derivatives  further  argues  against  its  use. 

One  positive  result  of  this  study  is  that  it  suggests  that  the 
three-parameter  logistic  model  does  a  better  job  of  fitting  high- 
ability  students  than  some  researchers  (including  the  authors)  had 
expected.  Had  we  examined  the  normal-ogive  model  instead,  which  is 
less  forgiving  of  incorrect  answers  to  easy  items  by  high-ability 
students,  we  would  possibly  have  found  a  stronger  case  for  the 
adoption  of  an  upper  asymptote  of  .99  or  .98. 


J 
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Appendix 


Formulas  for  derivatives  of  the  four-parameter  model*: 
n  =  number  of  items. 

N  =  number  of  subjects. 


D  =  1.7  . 

6  =  upper  asymptote. 

u_  =  observed  response  (1  if  right,  0  if  wrong)  for  item  i  ,  subject  j 

L.  .  -  a.(9 .  -  b.) 

13  i  J  i 


-DL.  . 

P. .  =  C.  +  (6  -  c.)/(l  +  e  1J) 
ij  i  i 


Q.  .  =  1  -  P. . 
13  13 


The  log  likelihood  is  given  by 


jXJL  luiitoPij  +  <1-V1"<W 


(Lord,  p.  58) 


Theta  derivatives: 
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*The  authors  are  grateful  to  Kirsten  Yocom  for  verifying  all  of 
the  derivatives. 
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Item- parameter  derivatives: 


We  will  drop  subscripts  for  brevity  in  the  following  formulas. 

If  x  represents  a^  ,  b^  ,  or  c ^  ,  and  ip  represents  a^  , 
bf  ,  or  ci  »  then 


l  (u  -  P) 

J-l 


c  - > 

K  3x  ; 


N 

=  r  [ 

j=i 


'  3*  M  3x  ' 


+  (u  -  P)[ 
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Attn:  PERI -OK 
Army  Research  Institute 
5001  Eisenhower  Avenue 
Alexandria,  VA  22333 


1  Mr.  Robert  Ross 
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University  Park 
Us  Angeles,  CA  90007 


-6- 


I 


f 


•j 


1  Dr.  William  E.  Coffman 

Director,  Iowa  Testing  Programs 
334  Lindquist  Center 
University  of  Iowa 
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1  Dr.  Meredith  P.  Crawford 
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Champaign,  IL  61801 
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Urbana,  IL  61801 
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1  Dr.  J.  Ryan 

Department  of  Education 
University  of  South  Carolina 
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in  the  Social  Sciences 
Stanford  University 
Stanford,  CA  94305 

1  Dr.  Hariharan  Swaminathan 

Laboratory  of  Psychometric  and 
Evaluation  Research 
School  of  Education 
University  of  Massacuuset ts 
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